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AGGCTGTCCCACCCACCATCTGCACCCGCTGCAGCGCCCGCGCCCCTGTCCCGCGCCGTAGTCGTCATTTGTAGCCC 
GCCTGCCGCTCCCGGGGACCCGATCCTACCCTGGGTGCGGGGCAGAGCGGGCATGGCCCGTCTGGGGACCGCCTGCC 
CTGCGCTGGCGCTGGCCCTGGCACTTGTGGCGGTGGCCCTGGCTGGAGTCAGAGCCCAGGGCGCAGCCTTCGAGGAG 
CCTGACTATTACAGCCAGGAGCTCTGGCGGCGCGGGCGCTATTATGGGCATCCGGAGCCTGAGCCGGAGCCGGAGCT 
CTTCTCGCCTTCAATGCATGAAGACCTTAGGGTGGAGGAGCAGGAACAGCAGGAGCCGCACCAGCAGGGCCACAGAA 
CTCCCAAGAAGGCCATCAAGCCCAAGAAGGCTCCCAAGAGGGAGAAGTTAGTTGCAGAGACGCCTCCACCAGGTAAA 
AATAGCAACAGAAAAGGCAGAAGAAGCAAGAATCTTGAGAAAGCTGCCAGTGATGACCATGGTGTCCCTGTGGCTCA 
TGAGGATGTCAGAGAGAGTTGCCCACCTCTTGGTCTGGAAACATTAAAAATCACAGACTTCCAGCTGCATGCCTCCA 
CATCGAAGCGTTATGGCCTGGGAGCCCACCGGGGGAGACTCAACATCCAGGCAGGCATTAATGAAAATGACTTTTAC 
GATGGGGCTTGGTGTGCTGGTAGGAACGACTTGCATCAGTGGATCGAAGTGGATGCCCGGCGCCTGACCAAGTTCAC 
AGGGGTCATTACCCAAGGAAGGAACTCTCTCTGGCTGAGTGACTGGGTGACATCCTATAAAGTCATGGTGAGCAATG 
ACAGCCACACATGGGTTACTGTGAAGAATGGATCTGGCGACATGATATTTGAAGGAAACAGTGAGAAGGAGATTCCT 
GTGCTCAATGAGCTGCCAGTCCCCATGGTGGCCCGCTACATTCGCATAAACCCTCAGTCCTGGTTTGATAACGGGAG 
CATCTGCATGAGGATGGAGATCTTGGGCTGCCCACTGCCGGATCCTAATAACTATTATCACCGACGTAATGAGATGA 
CCACCACGGATGACCTGGATTTTAAGCACCACAACTATAAGGAAATGCGCCAGTTGATGAAGGTTGTCAATGAAATG 
TGCCCCAATATTACCAGGATTTACAACATTGGCAAAAGCCACCAGGGCCTGAAATTGTATGCGGTAGAGATCTCTGA 
CCATCCTGGGGAACATGAAGTTGGTGAGCCCGAGTTCCACTACATCGCAGGGGCCCACGGCAATGAGGTTCTGGGAC 
GAGAACTGCTGCTGCTGCTGCTGCACTTCCTCTGCCAGGAATACTCGGCGCAGAACGCACGCATCGTCCGCTTGGTG 
GAGGAGACTCGAATCCACATTCTACCCTCCCTCAATCCTGATGGCTATGAGAAGGCCTATGAAGGAGGTTCCGAGTT 
GGGAGGCTGGTCCCTGGGACGTTGGACCCATGATGGCATCGATATCAACAACAACTTTCCGGATTTAAACTCGCTGC 
TCTGGGAGGCAGAGGACCAGCAGAATGCCCCAAGGAAGGTCCCCAACCACTACATTGCCATCCCTGAGTGGTTTCTG 
TCTGAGAATGCCACAGTGGCCACAGAGACCAGAGCCGTCATCGCCTGGATGGAGAAGATCCCGTTTGTGCTGGGAGG 
CAACCTACAGGGGGGTGAGCTGGTCGTGGCATACCCCTATGACATGGTGCGGTCCCTGTGGAAGACCCAGGAGCACA 
CCCCAACACCTGATGATCATGTGTTCCGCTGGCTGGCGTATTCCTACGCCTCCACTCACCGCCTCATGACAGATGCC 
AGGAGGCGAGTGTGCCACACGGAAGATTTTCAGAAGGAGGAGGGCACCGTCAATGGGGCTTCCTGGCACACAGTGGC 
TGGAAGTCTAAACGATTTCAGCTACCTCCATACAAACTGCTTTGAGCTGTCCATCTACGTGGGCTGTGATAAATACC 
CACACGAGAGCGAGCTGCCGGAGGAATGGGAGAATAACCGGGAGTCTCTGATTGTGTTCATGGAGCAGGTTCATCGA 
GGCATCAAAGGCATAGTGAGAGATTTACAAGGGAAAGGGATTTCAAATGCTGTCATCTCTGTGGAAGGTGTTAACCA 
TG AC ATC C GG AC AGC C AGC G ATGGGG ATT AC TGGC GTC T AC TG AAC C C TGGC G AAT ATGTGGTC AC AGC C AAGGC GG 
AAGGCTTTATCACTTCCACCAAGAACTGCATGGTTGGCTATGATATGGGAGCTACTCGGTGTGACTTCACCCTCACA 
AAGACC AAC C TGGC TAGGATAAGAGAAATTATGGAGACATTTGGGAAGC AGC CTGTCAGCCTACCCTCCAGGCGCCT 
GAAGCTGCGGGGACGGAAAAGGCGGCAGCGTGGGTGACCCTGTCGGACACTTGAGACATACCCCAGACCGTGCAAAT 
AAAAATC C AC TC C AGT AGT AAAAAAA 
(SEQ ID N0:1) 



MARLGTACPALALALALVAVALAGVRAQGAAFEEPDYYSQELWRRGRYYGHPEPEPEPELFSPSMHEDLR 

VEEQEQQEPHQQGHRTPKKAIKPKKAPKREKLVAETPPPGKNSNRKGRRSKNLEKAASDDHGVPVAHEDV 

RESCPPLGLETLKITDFQLHASTSKRYGLGAHRGRLNIQAGINENDFYDGAWCAGRNDLHQWIEVDARRL 

TKFTGVITQGRNSLWLSDWTSYKVMVSNDSOT 

INPQSWFDNGSICMRMEILGCPLPDPNNY^ 

NIGKSHQGLKLYAVEISDHPGEHEVGEPEFHYIAGAHGNEVLGRELLLLLLHFLCQEYSAQNARIVRLVE 

ETRIHILPSLNPDGYEKAYEGGSELGGWSLGRWTHDGIDINNNFPDLNSLLWEAEDQQNAPRKVPNHYIA 

IPEWFLSENATVATETRAVIAWMEKIPFVLGGNLQGGELWAYPYDMVRSLWKTQEHTPTPDDHVFRWLA 

YSYASTHRLMTDARRRVCHTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKYPHESEL 

PEEWENNRESLIVFMEQVHRGIKGIVRDLQGKGISNAVISVEGVNHDIRTASDGDYWRLLNPGEYVVTAK 

AEGFITSTKNCMVGYDMGATRCDFTLTKTNLARIREIMETFGKQPVSLPSRRLKLRGRKRRQRG (SEQ 

ID NO: 2) 
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underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



[AGGCTGTCCCACCCACCATCTGCACCCGCTGCAGCGCCCGCGCCCCTGTCCCGCGCCGT 
AGTCGTCATTTGTAGCCCGCCTGCCGCTCCCGGGGACCCGATCCTACCCTGGGTGCGGGG 
CAGAGCGGGCATGGCCCGTCTGGGGACCGCCTGCCCTGCGCTGGCGCTGGCCCTGGCACT 
TGTGGCGGTGGCCCTGGCTGGAGTCAGAGCCCAGGGCGCAGCCTTCGAGGAGCCTGACTA 
TTACAGCCAGGAGCTCTGGCGGCGCGGGCGCTATTATGGGCATCCGGAGCCTGAGCCGGA 
GCCGGAGCTCTTCTCGCCTTCAATGCA] TGAAGACCTTAGGGTGGAGGAGCAGGAACAGC 
AGGAGCCGCACCAGCAGGGCCACAGAACTCCCAAGAAGGCCATCAAGCCCAAGAAGGCTC 
CCAA [ GAGGGAGAAGTTAGTTGCAGAGACGCCTCCACCAG] GTAAAAATAGCAACAGAAA 
AGGCAGAAGAAGCAAGAATCTTGAGAAAGCTGCCAGTGATGACCATGGTGTCCCTGTGGC 
TCATGAGGATGTCAGAGAGAGTTGCCCACCTCTTGGTCTGGAAACATTAAAAATCACAGA 
CTTCCAGCTGCATGCCTCCACATCGAAGCGTTATGGCCTGGGAGCCCACCGGGGGAGACT 
C AAC ATC C AGGC AGGC ATT AATGAAAATG AC TTTT AC G ATGGGGC TTGGTGTGC TGGT AG 
GAACGACTTGCATCAGTGGATCGAAGTGGATGCCCGGCGCCTGACCAAGTTCACAGGGGT 
CATTACCCAAGGAAGGAACTCTCTCTGGCTGAGTGACTGGGTGACATCCTATAAAGTCAT 
GGTGAGCAATGACAGCCACACATGGGTTACTGTGAAGAATGGATCTGGCGACATGATATT 
TGAAGGAAACAGTGAGAAGGAGATTCCTGTGCTCAATGAGCTGCCAGTCCCCATGGTGGC 
CCGCTACATTCGCATAAACCCTCAGTCCTGGTTTGATAACGGGAGCATCTGCATGAGGAT 
GG AG ATC TTGGGC TGC C C AC TGC C GG ATC C T AAT AAC T ATT ATC AC C G AC GT AATG AG AT 
G AC C AC C AC GG ATGACC TGGATTTTAAGCACC AC AAC T AT AAGG AAATGC GC C AGTTG AT 
GAAGGTTGTCAATGAAATGTGCCCCAATATTACCAGGATTTACAACATTGGCAAAAGCCA 
C C AGGGC C TG AAATTGT ATGCGGT AGAG ATC TCTGAC CATC C TGGGG AAC ATG AAGTTGG 
TGAGCCCGAGTTCCACTACATCGCAGGGGCCCACGGCAATGAGGTTCTGGGACGAGAACT 
GCTGCTGCTGCTGCTGCACTTCCTCTGCCAGGAATACTCGGCGCAGAACGCACGCATCGT 
CCGCTTGGTGGAGGAGACTCGAATCCACATTCTACCCTCCCTCAATCCTGATGGCTATGA 
GAAGGCCTATGAAGGAGGTTCCGAGTTGGGAGGCTGGTCCCTGGGACGTTGGACCCATGA 
TGGCATCGATATCAACAACAACTTTCCGGATTTAAACTCGCTGCTCTGGGAGGCAGAGGA 
CCAGCAGAATGCCCCAAGGAAGGTCCCCAACCACTACATTGCCATCCCTGAGTGGTTTCT 
GTCTGAGAATGCCACAGTGGCCACAGAGACCAGAGCCGTCATCGCCTGGATGGAGAAGAT 
CCCGTTTGTGCTGGGAGGCAACCTACAGGGGGGTGAGCTGGTCGTGGCATACCCCTATGA 
CATGGTGCGGTCCCTGTGGAAGACCCAGGAGCACACCCCAACACCTGATGATCATGTGTT 
CCGCTGGCTGGCGTATTCCTACGCCTCCACTCACCGCCTCATGACAGATGCCAGGAGGCG 
AGTGTGCCACACGGAAGATTTTCAGAAGGAGGAGGGCACCGTCAATGGGGCTTCCTGGCA 
C AC AGTGGC TGG AAGTC TAAAC GATTTC AGC T AC C TC C ATAC AAACTGC TTTGAGC TGTC 
CATCTACGTGGGCTGTGATAAATACCCACACGAGAGCGAGCTGCCGGAGGAATGGGAGAA 
TAACCGGGAGTCTCTGATTGTGTTCATGGAGCAGGTTCATCGAGGCATCAAAGGCATAGT 
GAGAGATTTAC AAGGGAAAGGGATTTC AAATGC TGTC ATC TCTGTGGAAGGTGTT AAC C A 
TGACATCCGGACAGCCAGCGATGGGGATTACTGGCGTCTACTGAACCCTGGCGAATATGT 
GGTCACAGCCAAGGCGGAAGGCTTTATCACTTCCACCAAGAACTGCATGGTTGGCTATGA 
TATGGGAGCTACTCGGTGTGACTTCACCCTCACAAAGACCAACCTGGCTAGGATAAGAGA 
AATTATGGAGACATTTGGGAAGCAGCCTGTCAGCCTACCCTCCAGGCGCCTGAAGCTGCG 
GGGACGGAAAAGGCGGCAGCGTGGGTGACCCTGTCGGACACTTGAGACATACCCCAGACC 
GTGC AAAT AAAAATC C ACTC C AGT AGT AAAAAAA (SEQ ID N0:1) 
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Gene Sequence Structure 



Size of full-length 
cDNA: 2490 bp 



327 bp 



Sequence Deleted 



422 bp 




Targeting Vector* 
(genomic sequence) 

Construct Number: 2825 



Arm Length: 
5': 4 kb 
3': 1.5 kb 



5' arm 



LacZ-Neo 
Cassette 



3' arm 




Targeting Vector 
Endogenous Locus 



Not drawn to scale 



5 ' >GGCATGGCCCGTCTGGGGACC 
GC C TGC C C TGC GC TGGC GC TGGC C 
C TGGC AC TTGTGGC GGTGGC C C TG 
GC TGG AGTC AG AGC C C AGGGC GC A 
GC CTTC G AGG AGCC TG AC T ATT AC 
AGCCAGGAGCTCTGGCGGCGCGGG 
CGCTATTATGGGCATCCGGAGCCT 
GAGCCGGAGCCGGAGCTCTTCTCG 
CCTTCAATGCA<3 1 
(SEQ ID NO: 3) 



5 • >GAGGGAGAAGTTAGTTGCAGA 
G AC GC C TC C AC C AGGT AAC TTTTG 
CATCGGGCAGCCCGAGGGGGCGCC 
AGC G ATC GTGGC AC TC C AGGGG AC 
AC C TGGC TTCC AGT ATGTTTTC TT 
G AGTG AGC CC AGC C AAAGTC C TGT 
GGTGC C TGTGTT ATTC C C TAG AG A 
C T AC ATC TG AGC T AAGTTC AGC TT 
TCTCTCCCTGCO' 
(SEQ ID NO: 4) 
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